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Abstract 


This document discusses the applicability of the QUIC transport protocol, focusing on caveats 
impacting application protocol development and deployment over QUIC. Its intended audience is 
designers of application protocol mappings to QUIC and implementors of these application 
protocols. 


Status of This Memo 


This document is not an Internet Standards Track specification; it is published for informational 
purposes. 


This document is a product of the Internet Engineering Task Force (IETF). It represents the 
consensus of the IETF community. It has received public review and has been approved for 
publication by the Internet Engineering Steering Group (IESG). Not all documents approved by 
the IESG are candidates for any level of Internet Standard; see Section 2 of RFC 7841. 


Information about the current status ofthis document, any errata, and how to provide feedback 
on it may be obtained at https://www.rfc-editor.org/info/rfc9308. 


Copyright Notice 


Copyright (c) 2022 IETF Trust and the persons identified as the document authors. All rights 
reserved. 


This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF 
Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this 
document. Please review these documents carefully, as they describe your rights and restrictions 
with respect to this document. Code Components extracted from this document must include 
Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are 
provided without warranty as described in the Revised BSD License. 
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1. Introduction 


QUIC [QUIC] is a new transport protocol providing anumber of advanced features. While 
initially designed for the HTTP use case, it provides capabilities that can be used with a much 
wider variety of applications. QUIC is encapsulated in UDP. QUIC version 1 integrates TLS 1.3 
[TLS13] to encrypt all payload data and most control information. The version of HTTP that uses 
QUIC is known as HTTP/3 [QUIC-HTTP]. 


This document provides guidance for application developers who want to use the QUIC protocol 
without implementing it on their own. This includes general guidance for applications operating 
over HTTP/3 or directly over QUIC. 


In the following sections, we discuss specific caveats to QUIC's applicability and issues that 
application developers must consider when using QUIC as a transport for their applications. 


2. The Necessity of Fallback 


QUIC uses UDP as a substrate. This enables userspace implementation and permits traversal of 
network middleboxes (including NAT) without requiring updates to existing network 
infrastructure. 


Measurement studies have shown between 3% [Trammell16] and 5% [Swett16] of networks 
block all UDP traffic, though there is little evidence of other forms of systematic disadvantage to 
UDP traffic compared to TCP [Edeline16]. This blocking implies that all applications running on 
top of QUIC must either be prepared to accept connectivity failure on such networks or be 
engineered to fall back to some other transport protocol. In the case of HTTP, this fallback is TLS 
over TCP. 


The IETF Transport Services (TAPS) specifications [TAPS-ARCH] describe a system with a common 
API for multiple protocols. This is particularly relevant for QUIC as it addresses the implications 
of fallback among multiple protocols. 
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Specifically, fallback to insecure protocols or to weaker versions of secure protocols needs to be 
avoided. In general, an application that implements fallback needs to consider the security 
consequences. A fallback to TCP and TLS exposes control information to modification and 
manipulation in the network. Additionally, downgrades to TLS versions older than 1.3, which is 
used in QUIC version 1, might result in significantly weaker cryptographic protection. For 
example, the results of protocol negotiation [RFC7301] only have confidentiality protection if TLS 
1.3 is used. 


These applications must operate, perhaps with impaired functionality, in the absence of features 
provided by QUIC not present in the fallback protocol. For fallback to TLS over TCP, the most 
obvious difference is that TCP does not provide stream multiplexing, and therefore stream 
multiplexing would need to be implemented in the application layer if needed. Further, TCP 
implementations and network paths often do not support the TCP Fast Open (TFO) option 
[RFC7413], which enables sending of payload data together with the first control packet of a new 
connection as also provided by 0-RTT session resumption in QUIC. Note that there is some 
evidence of middleboxes blocking SYN data even if TFO was successfully negotiated (see 
[PaaschNanog]). And even if Fast Open successfully operates end to end, it is limited to a single 
packet of TLS handshake and application data, unlike QUIC 0-RTT. 


Moreover, while encryption (in this case TLS) is inseparably integrated with QUIC, TLS 
negotiation over TCP can be blocked. If TLS over TCP cannot be supported, the connection should 
be aborted, and the application then ought to present a suitable prompt to the user that secure 
communication is unavailable. 


In summary, any fallback mechanism is likely to impose a degradation of performance and can 
degrade security; however, fallback must not silently violate the application's expectation of 
confidentiality or integrity of its payload data. 


3. O-RTT 


QUIC provides for 0-RTT connection establishment. Though the same facility exists in TLS 1.3 
with TCP, 0-RTT presents opportunities and challenges for applications using QUIC. 


A transport protocol that provides 0-RTT connection establishment is qualitatively different from 
one that does not provide 0-RTT from the point of view of the application using it. Relative trade- 
offs between the cost of closing and reopening a connection and trying to keep it open are 
different; see Section 3.2. 


An application needs to deliberately choose to use O-RTT, as 0-RTT carries a risk of replay attack. 
Application protocols that use 0-RTT require a profile that describes the types of information that 
can be safely sent. For HTTP, this profile is described in [HTTP-REPLAY]. 

3.1. Replay Attacks 


Retransmission or malicious replay of data contained in 0-RTT packets could cause the server 
side to receive multiple copies of the same data. 
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Application data sent by the client in O-RTT packets could be processed more than once ifitis 
replayed. Applications need to be aware of what is safe to send in O-RTT. Application protocols 
that seek to enable the use of 0-RTT need a careful analysis and a description of what can be sent 
in 0-RTT; see Section 5.6 of [QUIC-TLS]. 


In some cases, it might be sufficient to limit application data sent in O-RTT to data that does not 
cause actions with lasting effects at a server. Initiating data retrieval or establishing 
configuration are examples of actions that could be safe. Idempotent operations -- those for 
which repetition has the same net effect as a single operation -- might be safe. However, it is also 
possible to combine individually idempotent operations into a non-idempotent sequence of 
operations. 


Once a server accepts 0-RTT data, there is no means of selectively discarding data that is 
received. However, protocols can define ways to reject individual actions that might be unsafe if 
replayed. 


Some TLS implementations and deployments might be able to provide partial or even complete 
replay protection, which could be used to manage replay risk. 


3.2. Session Resumption versus Keep-Alive 


Because QUIC is encapsulated in UDP, applications using QUIC must deal with short network idle 
timeouts. Deployed stateful middleboxes will generally establish state for UDP flows on the first 
packet sent and keep state for much shorter idle periods than for TCP. [RFC5382] suggests a TCP 
idle period of at least 124 minutes, though there is no evidence of widespread implementation of 
this guideline in the literature. However, short network timeout for UDP is well-documented. 
According to a 2010 study ([Hatonen10]), UDP applications can assume that any NAT binding or 
other state entry can expire after just thirty seconds of inactivity. Section 3.5 of [RFC8085] further 
discusses keep-alive intervals for UDP: it requires that there is a minimum value of 15 seconds, 
but recommends larger values, or that keep-alive is omitted entirely. 


By using a connection ID, QUIC is designed to be robust to NAT rebinding after a timeout. 
However, this only helps if one endpoint maintains availability at the address its peer uses and 
the peer is the one to send after the timeout occurs. 


Some QUIC connections might not be robust to NAT rebinding because the routing infrastructure 
(in particular, load balancers) uses the address/port 4-tuple to direct traffic. Furthermore, 
middleboxes with functions other than address translation could still affect the path. In 
particular, some firewalls do not admit server traffic for which the firewall has no recent state 
for a corresponding packet sent from the client. 


QUIC applications can adjust idle periods to manage the risk of timeout. Idle periods and the 
network idle timeout are distinct from the connection idle timeout, which is defined as the 
minimum of either endpoint's idle timeout parameter; see Section 10.1 of [QUIC]. There are three 
options: 


e Ignore the issue if the application-layer protocol consists only of interactions with no or very 
short idle periods or if the protocol's resistance to NAT rebinding is sufficient. 
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e Ensure there are no long idle periods. 
e Resume the session after a long idle period, using 0-RTT resumption when appropriate. 


The first strategy is the easiest, but it only applies to certain applications. 


Either the server or the client in a QUIC application can send PING frames as keep-alives to 
prevent the connection and any on-path state from timing out. Recommendations for the use of 
keep-alives are application specific, mainly depending on the latency requirements and message 
frequency of the application. In this case, the application mapping must specify whether the 
client or server is responsible for keeping the application alive. While [Hatonen10] suggests that 
30 seconds might be a suitable value for the public Internet when a NAT is on path, larger values 
are preferable if the deployment can consistently survive NAT rebinding or is known to be ina 
controlled environment (e.g., data centers) in order to lower network and computational load. 


Sending PING frames more frequently than every 30 seconds over long idle periods may result in 
excessive unproductive traffic in some situations and unacceptable power usage for power- 
constrained (mobile) devices. Additionally, timeouts shorter than 30 seconds can make it harder 
to handle transient network interruptions, such as Virtual Machine (VM) migration or coverage 
loss during mobility. See [RFC8085], especially Section 3.5. 


Alternatively, the client (but not the server) can use session resumption instead of sending keep- 
alive traffic. In this case, a client that wants to send data to a server over a connection that has 
been idle longer than the server's idle timeout (available from the idle_timeout transport 
parameter) can simply reconnect. When possible, this reconnection can use 0-RTT session 
resumption, reducing the latency involved with restarting the connection. Of course, this 
approach is only valid in cases in which it is safe to use O-RTT and when the client is the 
restarting peer. 


The trade-offs between resumption and keep-alives need to be evaluated on a per-application 
basis. In general, applications should use keep-alives only in circumstances where continued 
communication is highly likely; [QUIC-HTTP], for instance, recommends using keep-alives only 
when a request is outstanding. 


4. Use of Streams 


QUIC's stream multiplexing feature allows applications to run multiple streams over a single 
connection without head-of-line blocking between streams. Stream data is carried within frames 
where one QUIC packet on the wire can carry one or multiple stream frames. 


Streams can be unidirectional or bidirectional, and a stream may be initiated either by client or 
server. Only the initiator of a unidirectional stream can send data on it. 

Streams and connections can each carry a maximum of 2521 bytes in each direction due to 
encoding limitations on stream offsets and connection flow control limits. In the presently 
unlikely event that this limit is reached by an application, a new connection would need to be 
established. 
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Streams can be independently opened and closed, gracefully or abruptly. An application can 
gracefully close the egress direction of a stream by instructing QUIC to send a FIN bit ina 
STREAM frame. It cannot gracefully close the ingress direction without a peer-generated FIN, 
much like in TCP. However, an endpoint can abruptly close the egress direction or request that its 
peer abruptly close the ingress direction; these actions are fully independent of each other. 


QUIC does not provide an interface for exceptional handling of any stream. If a stream that is 
critical for an application is closed, the application can generate error messages on the 
application layer to inform the other end and/or the higher layer, which can eventually 
terminate the QUIC connection. 


Mapping of application data to streams is application specific and described for HTTP/3 in [QUIC- 
HTTP]. There are a few general principles to apply when designing an application's use of 
streams: 


e A single stream provides ordering. If the application requires certain data to be received in 
order, that data should be sent on the same stream. There is no guarantee of transmission, 
reception, or delivery order across streams. 


e Multiple streams provide concurrency. Data that can be processed independently, and 
therefore would suffer from head-of-line blocking if forced to be received in order, should be 
transmitted over separate streams. 


e Streams can provide message orientation and allow messages to be canceled. If one message 
is mapped to a single stream, resetting the stream to expire an unacknowledged message can 
be used to emulate partial reliability for that message. 


If a QUIC receiver has opened the maximum allowed concurrent streams, and the sender 
indicates that more streams are needed, it does not automatically lead to an increase of the 
maximum number of streams by the receiver. Therefore, an application should consider the 
maximum number of allowed, currently open, and currently used streams when determining 
how to map data to streams. 


QUIC assigns a numerical identifier, called the stream ID, to each stream. While the relationship 
between these identifiers and stream types is clearly defined in version 1 of QUIC, future 
versions might change this relationship for various reasons. QUIC implementations should 
expose the properties of each stream (which endpoint initiated the stream, whether the stream is 
unidirectional or bidirectional, the stream ID used for the stream); applications should query for 
these properties rather than attempting to infer them from the stream ID. 


The method of allocating stream identifiers to streams opened by the application might vary 
between transport implementations. Therefore, an application should not assume a particular 
stream ID will be assigned to a stream that has not yet been allocated. For example, HTTP/3 uses 
stream IDs to refer to streams that have already been opened but makes no assumptions about 
future stream IDs or the way in which they are assigned (see Section 6 of [QUIC-HTTP]). 
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4.1. Stream versus Flow Multiplexing 


Streams are meaningful only to the application; since stream information is carried inside QUIC's 
encryption boundary, a given packet exposes no information about which stream(s) are carried 
within the packet. Therefore, stream multiplexing is not intended to be used for differentiating 
streams in terms of network treatment. Application traffic requiring different network treatment 
should therefore be carried over different 5-tuples (i.e., multiple QUIC connections). Given QUIC's 
ability to send application data in the first RTT of a connection (if a previous connection to the 
same host has been successfully established to provide the necessary credentials), the cost of 
establishing another connection is extremely low. 


4.2. Prioritization 


Stream prioritization is not exposed to either the network or the receiver. Prioritization is 
managed by the sender, and the QUIC transport should provide an interface for applications to 
prioritize streams [QUIC]. Applications can implement their own prioritization scheme on top of 
QUIC: an application protocol that runs on top of QUIC can define explicit messages for signaling 
priority, such as those defined in [RFC9218] for HTTP. An application protocol can define rules 
that allow an endpoint to determine priority based on context or can provide a higher-level 
interface and leave the determination to the application on top. 


Priority handling of retransmissions can be implemented by the sender in the transport layer. 
[QUIC] recommends retransmitting lost data before new data, unless indicated differently by the 
application. When a QUIC endpoint uses fully reliable streams for transmission, prioritization of 
retransmissions will be beneficial in most cases, filling in gaps and freeing up the flow control 
window. For partially reliable or unreliable streams, priority scheduling of retransmissions over 
data of higher-priority streams might not be desirable. For such streams, QUIC could either 
provide an explicit interface to control prioritization or derive the prioritization decision from 
the reliability level of the stream. 


4.3. Ordered and Reliable Delivery 


QUIC streams enable ordered and reliable delivery. Though it is possible for an implementation 
to provide options that use streams for partial reliability or out-of-order delivery, most 
implementations will assume that data is reliably delivered in order. 


Under this assumption, an endpoint that receives stream data might not make forward progress 
until data that is contiguous with the start of a stream is available. In particular, a receiver might 
withhold flow control credit until contiguous data is delivered to the application; see Section 2.2 
of [QUIC]. To support this receive logic, an endpoint will send stream data until it is 
acknowledged, ensuring that data at the start of the stream is sent and acknowledged first. 


An endpoint that uses a different sending behavior and does not negotiate that change with its 
peer might encounter performance issues or deadlocks. 
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4.4. Flow Control Deadlocks 


QUIC flow control (Section 4 of [QUIC]) provides a means of managing access to the limited 
buffers that endpoints have for incoming data. This mechanism limits the amount of data that 
can be in buffers in endpoints or in transit on the network. However, there are several waysin 
which limits can produce conditions that can cause a connection to either perform suboptimally 
or become deadlocked. 


Deadlocks in flow control are possible for any protocol that uses QUIC, though whether they 
become a problem depends on how implementations consume data and provide flow control 
credit. Understanding what causes deadlocking might help implementations avoid deadlocks. 


The size and rate of updates to flow control credit can affect performance. Applications that use 
QUIC often have a data consumer that reads data from transport buffers. Some implementations 
might have independent receive buffers at the transport layer and application layer. Consuming 
data does not always imply it is immediately processed. However, a common implementation 
technique is to extend flow control credit to the sender by emitting MAX_DATA and/or 
MAX_STREAM_DATA frames as data is consumed. Delivery of these frames is affected by the 
latency of the back channel from the receiver to the data sender. If credit is not extended in a 
timely manner, the sending application can be blocked, effectively throttling the sender. 


Large application messages can produce deadlocking if the recipient does not read data from the 
transport incrementally. If the message is larger than the flow control credit available and the 
recipient does not release additional flow control credit until the entire message is received and 
delivered, a deadlock can occur. This is possible even where stream flow control limits are not 
reached because connection flow control limits can be consumed by other streams. 


A length-prefixed message format makes it easier for a data consumer to leave data unread in 
the transport buffer and thereby withhold flow control credit. If flow control limits prevent the 
remainder of a message from being sent, a deadlock will result. A length prefix might also enable 
the detection of this sort of deadlock. Where application protocols have messages that might be 
processed as a single unit, reserving flow control credit for the entire message atomically makes 
this style of deadlock less likely. 


A data consumer can eagerly read all data as it becomes available in order to make the receiver 
extend flow control credit and reduce the chances of a deadlock. However, such a data consumer 
might need other means for holding a peer accountable for the additional state it keeps for 
partially processed messages. 


Deadlocking can also occur if data on different streams is interdependent. Suppose that data on 
one stream arrives before the data on a second stream on which it depends. A deadlock can 
occur if the first stream is left unread, preventing the receiver from extending flow control credit 
for the second stream. To reduce the likelihood of deadlock for interdependent data, the sender 
should ensure that dependent data is not sent until the data it depends on has been accounted 
for in both stream- and connection-level flow control credit. 
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Some deadlocking scenarios might be resolved by canceling affected streams with 
STOP_SENDING or RESET_STREAM. Canceling some streams results in the connection being 
terminated in some protocols. 


4.5. Stream Limit Commitments 


QUIC endpoints are responsible for communicating the cumulative limit of streams they would 
allow to be opened by their peer. Initial limits are advertised using the initial_max_streams_bidi 
and initial_max_streams_uni transport parameters. As streams are opened and closed, they are 
consumed, and the cumulative total is incremented. Limits can be increased using the 
MAX_STREAMS frame, but there is no mechanism to reduce limits. Once stream limits are 
reached, no more streams can be opened, which prevents applications using QUIC from making 
further progress. At this stage, connections can be terminated via idle timeout or explicit close; 
see Section 10. 


An application that uses QUIC and communicates a cumulative stream limit might require the 
connection to be closed before the limit is reached, e.g., to stop the server in order to perform 
scheduled maintenance. Immediate connection close causes abrupt closure of actively used 
streams. Depending on how an application uses QUIC streams, this could be undesirable or 
detrimental to behavior or performance. 


A more graceful closure technique is to stop sending increases to stream limits and allow the 
connection to naturally terminate once remaining streams are consumed. However, the period 
oftime it takes to do so is dependent on the peer, and an unpredictable closing period might not 
fit application or operational needs. Applications using QUIC can be conservative with open 
stream limits in order to reduce the commitment and indeterminism. However, being overly 
conservative with stream limits affects stream concurrency. Balancing these aspects can be 
specific to applications and their deployments. 


Instead of relying on stream limits to avoid abrupt closure, an application layer's graceful close 
mechanism can be used to communicate the intention to explicitly close the connection at some 
future point. HTTP/3 provides such a mechanism using the GOAWAY frame. In HTTP/3, when the 
GOAWAY frame is received by a client, it stops opening new streams even ifthe cumulative 
stream limit would allow. Instead, the client would create a new connection on which to open 
further streams. Once all streams are closed on the old connection, it can be terminated safely by 
a connection close or after expiration of the idle timeout (see Section 10). 


5. Packetization and Latency 


QUIC exposes an interface that provides multiple streams to the application; however, the 
application usually cannot control how data transmitted over those streams is mapped into 
frames or how those frames are bundled into packets. 


By default, many implementations will try to pack STREAM frames from one or more streams 
into each QUIC packet, in order to minimize bandwidth consumption and computational costs 
(see Section 13 of [QUIC]). If there is not enough data available to fill a packet, an implementation 
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might wait for a short time to optimize bandwidth efficiency instead of latency. This delay can 
either be preconfigured or dynamically adjusted based on the observed sending pattern of the 
application. 


If the application requires low latency, with only small chunks of data to send, it may be valuable 
to indicate to QUIC that all data should be sent out immediately. Alternatively, if the application 
expects to use a specific sending pattern, it can also provide a suggested delay to QUIC for how 
long to wait before bundling frames into a packet. 


Similarly, an application usually has no control over the length of a QUIC packet on the wire. 
QUIC provides the ability to add a PADDING frame to arbitrarily increase the size of packets. 
Padding is used by QUIC to ensure that the path is capable of transferring datagrams of at least a 
certain size during the handshake (see Sections 8.1 and 14.1 of [QUIC]) and for path validation 
after connection migration (see Section 8.2 of [QUIC]) as well as for Datagram Packetization 
Layer PMTU Discovery (DPLPMTUD) (see Section 14.3 of [QUIC]). 


Padding can also be used by an application to reduce leakage of information about the data that 
is sent. A QUIC implementation can expose an interface that allows an application layer to 
specify how to apply padding. 


6. Error Handling 


QUIC recommends that endpoints signal any detected errors to the peer. Errors can occur at the 
transport layer and the application layer. Transport errors, such as a protocol violation, affect the 
entire connection. Applications that use QUIC can define their own error detection and signaling 
(see, for example, Section 8 of [QUIC-HTTP]). Application errors can affect an entire connection 
or a single stream. 


QUIC defines an error code space that is used for error handling at the transport layer. QUIC 
encourages endpoints to use the most specific code, although any applicable code is permitted, 
including generic ones. 


Applications using QUIC define an error code space that is independent of QUIC or other 
applications (see, for example, Section 8.1 of [QUIC-HTTP]). The values in an application error 
code space can be reused across connection-level and stream-level errors. 


Connection errors lead to connection termination. They are signaled using a 
CONNECTION_CLOSE frame, which contains an error code and a reason field that can be zero 
length. Different types of CONNECTION_CLOSE frames are used to signal transport and 
application errors. 


Stream errors lead to stream termination. These are signaled using STOP_SENDING or 
RESET_STREAM frames, which contain only an error code. 
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7. Acknowledgment Efficiency 


QUIC version 1 without extensions uses an acknowledgment strategy adopted from TCP (see 
Section 13.2 of [QUIC]). That is, it recommends that every other packet is acknowledged. 
However, generating and processing QUIC acknowledgments consumes resources at a sender 
and receiver. Acknowledgments also incur forwarding costs and contribute to link utilization, 
which can impact performance over some types of network. Applications might be able to 
improve overall performance by using alternative strategies that reduce the rate of 
acknowledgments. [QUIC-ACK-FREQUENCY] describes an extension to signal the desired delay of 
acknowledgments and discusses use cases as well as implications for congestion control and 
recovery. 


8. Port Selection and Application Endpoint Discovery 


In general, port numbers serve two purposes: "first, they provide a demultiplexing identifier to 
differentiate transport sessions between the same pair of endpoints, and second, they may also 
identify the application protocol and associated service to which processes connect" (Section 3 of 
[RFC6335]). The assumption that an application can be identified in the network based on the 
port number is less true today due to encapsulation and mechanisms for dynamic port 
assignments, as noted in [RFC6335]. 


As QUIC is a general-purpose transport protocol, there are no requirements that servers use a 
particular UDP port for QUIC. For an application with a fallback to TCP that does not already 
have an alternate mapping to UDP, it is usually appropriate to register (if necessary) and use the 
UDP port number corresponding to the TCP port already registered for the application. For 
example, the default port for HTTP/3 [QUIC-HTTP] is UDP port 443, analogous to HTTP/1.1 or 
HTTP/2 over TLS over TCP. 


Given the prevalence of the assumption in network management practice that a port number 
maps unambiguously to an application, the use of ports that cannot easily be mapped to a 
registered service name might lead to blocking or other changes to the forwarding behavior by 
network elements such as firewalls that use the port number for application identification. 


Applications could define an alternate endpoint discovery mechanism to allow the usage of ports 
other than the default. For example, HTTP/3 (Sections 3.2 and 3.3 of [QUIC-HTTP]) specifies the 
use of HTTP Alternative Services [RFC7838] for an HTTP origin to advertise the availability of an 
equivalent HTTP/3 endpoint on a certain UDP port by using "h3" as the Application-Layer 
Protocol Negotiation (ALPN) [RFC7301] token. 


ALPN permits the client and server to negotiate which of several protocols will be used on a 
given connection. Therefore, multiple applications might be supported on a single UDP port 
based on the ALPN token offered. Applications using QUIC are required to register an ALPN 
token for use in the TLS handshake. 
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As QUIC version 1 deferred defining a complete version negotiation mechanism, HTTP/3 requires 
QUIC version 1 and defines the ALPN token ("h3") to only apply to that version. So far, no single 
approach has been selected for managing the use of different QUIC versions, neither in HTTP/3 
nor in general. Application protocols that use QUIC need to consider how the protocol will 
manage different QUIC versions. Decisions for those protocols might be informed by choices 
made by other protocols, like HTTP/3. 


8.1. Source Port Selection 


Some UDP protocols are vulnerable to reflection attacks, where an attacker is able to direct traffic 
to a third party as a denial of service. For example, these source ports are associated with 
applications known to be vulnerable to reflection attacks, often due to server misconfiguration: 


port 53 - DNS [RFC1034] 

e port 123 - NTP [RFC5905] 

e port 1900 - SSDP [SSDP] 

e port 5353 - mDNS [RFC6762] 
e port 11211 - memcache 


Services might block source ports associated with protocols known to be vulnerable to reflection 
attacks to avoid the overhead of processing large numbers of packets. However, this practice has 
negative effects on clients -- not only does it require establishment of a new connection but in 
some instances might cause the client to avoid using QUIC for that service for a period of time 
and downgrade to a non-UDP protocol (see Section 2). 


As a result, client implementations are encouraged to avoid using source ports associated with 
protocols known to be vulnerable to reflection attacks. Note that following the general guidance 
for client implementations given in [RFC6335], to use ephemeral ports in the range 49152-65535, 
has the effect of avoiding these ports. Note that other source ports might be reflection vectors as 
well. 


9. Connection Migration 


QUIC supports connection migration by the client. If the client's IP address changes, a QUIC 
endpoint can still associate packets with an existing transport connection using the Destination 
Connection ID field (see Section 11) in the QUIC header. This supports cases where the address 
information changes, such as NAT rebinding, the intentional change of the local interface, the 
expiration of a temporary IPv6 address [RFC8981], or the indication from the server of a 
preferred address (Section 9.6 of [QUIC]). 


Use of a non-zero-length connection ID for the server is strongly recommended if any clients are 
or could be behind a NAT. A non-zero-length connection ID is also strongly recommended when 
active migration is supported. If a connection is intentionally migrated to a new path, a new 
connection ID is used to minimize linkability by network observers. The other QUIC endpoint 
uses the connection ID to link different addresses to the same connection and entity if a non- 
zero-length connection ID is provided. 
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The base specification of QUIC version 1 only supports the use of a single network path at a time, 
which enables failover use cases. Path validation is required so that endpoints validate paths 
before use to avoid address spoofing attacks. Path validation takes at least one RTT, and 
congestion control will also be reset after path migration. Therefore, migration usually has a 
performance impact. 


QUIC probing packets, which can be sent on multiple paths at once, are used to perform address 
validation as well as measure path characteristics. Probing packets cannot carry application data 
but likely contain padding frames. Endpoints can use information about their receipt as input to 
congestion control for that path. Applications could use information learned from probing to 
inform a decision to switch paths. 


Only the client can actively migrate in version 1 of QUIC. However, servers can indicate during 
the handshake that they prefer to transfer the connection to a different address after the 
handshake. For instance, this could be used to move from an address that is shared by multiple 
servers to an address that is unique to the server instance. The server can provide an IPv4 and 
an IPv6 address in a transport parameter during the TLS handshake, and the client can select 
between the two ifboth are provided. See Section 9.6 of [QUIC]. 


10. Connection Termination 


QUIC connections are terminated in one of three ways: implicit idle timeout, explicit immediate 
close, or explicit stateless reset. 


QUIC does not provide any mechanism for graceful connection termination; applications using 
QUIC can define their own graceful termination process (see, for example, Section 5.2 of [QUIC- 
HTTP)). 


QUIC idle timeout is enabled via transport parameters. The client and server announce a timeout 
period, and the effective value for the connection is the minimum of the two values. After the 
timeout period elapses, the connection is silently closed. An application therefore should be able 
to configure its own maximum value, as well as have access to the computed minimum value for 
this connection. An application may adjust the maximum idle timeout for new connections based 
on the number of open or expected connections since shorter timeout values may free up 
resources more quickly. 


Application data exchanged on streams or in datagrams defers the QUIC idle timeout. 
Applications that provide their own keep-alive mechanisms will therefore keep a QUIC 
connection alive. Applications that do not provide their own keep-alive can use transport-layer 
mechanisms (see Section 10.1.2 of [QUIC] and Section 3.2). However, QUIC implementation 
interfaces for controlling such transport behavior can vary, affecting the robustness of such 
approaches. 


An immediate close is signaled by a CONNECTION_CLOSE frame (see Section 6). Immediate close 
causes all streams to become immediately closed, which may affect applications; see Section 4.5. 
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A stateless reset is an option of last resort for an endpoint that does not have access to 
connection state. Receiving a stateless reset is an indication of an unrecoverable error distinct 
from connection errors in that there is no application-layer information provided. 


11. Information Exposure and the Connection ID 


QUIC exposes some information to the network in the unencrypted part ofthe header either 
before the encryption context is established or because the information is intended to be used by 
the network. For more information on manageability of QUIC, see [QUIC-MANAGEABILITY]. QUIC 
has a long header that exposes some additional information (the version and the source 
connection ID), while the short header exposes only the destination connection ID. In QUIC 
version 1, the long header is used during connection establishment, while the short header is 
used for data transmission in an established connection. 


The connection ID can be zero length. Zero-length connection IDs can be chosen on each 
endpoint individually and on any packet except the first packets sent by clients during 
connection establishment. 


An endpoint that selects a zero-length connection ID will receive packets with a zero-length 
destination connection ID. The endpoint needs to use other information, such as the source and 
destination IP address and port number to identify which connection is referred to. This could 
mean that the endpoint is unable to match datagrams to connections successfully if these values 
change, making the connection effectively unable to survive NAT rebinding or migrate to a new 
path. 


11.1. Server-Generated Connection ID 


QUIC supports a server-generated connection ID that is transmitted to the client during 
connection establishment (see Section 7.2 of [QUIC]). Servers behind load balancers may need to 
change the connection ID during the handshake, encoding the identity of the server or 
information about its load balancing pool, in order to support stateless load balancing. 


Server deployments with load balancers and other routing infrastructure need to ensure that 
this infrastructure consistently routes packets to the server instance that has the connection 
state, even if addresses, ports, or connection IDs change. This might require coordination 
between servers and infrastructure. One method of achieving this involves encoding routing 
information into the connection ID. For an example of this technique, see [QUIC-LB]. 


11.2. Mitigating Timing Linkability with Connection ID Migration 


If QUIC endpoints do not issue fresh connection IDs, then clients cannot reduce the linkability of 
address migration by using them. Choosing values that are unlinkable to an outside observer 
ensures that activity on different paths cannot be trivially correlated using the connection ID. 


While sufficiently robust connection ID generation schemes will mitigate linkability issues, they 
do not provide full protection. Analysis of the lifetimes of 6-tuples (source and destination 
addresses as well as the migrated Connection ID) may expose these links anyway. 
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In the case where connection migration in a server pool is rare, it is trivial for an observer to 
associate two connection IDs. Conversely, where every server handles multiple simultaneous 
migrations, even an exposed server mapping may be insufficient information. 


The most efficient mitigations for these attacks are through network design and/or operational 
practices, by using a load-balancing architecture that loads more flows onto a single server-side 
address, by coordinating the timing of migrations in an attempt to increase the number of 
simultaneous migrations at a given time, or by using other means. 


11.3. Using Server Retry for Redirection 


QUIC provides a Retry packet that can be sent by a server in response to the client Initial packet. 
The server may choose a new connection ID in that packet, and the client will retry by sending 
another client Initial packet with the server-selected connection ID. This mechanism can be used 
to redirect a connection to a different server, e.g., due to performance reasons or when servers in 
a server pool are upgraded gradually and therefore may support different versions of QUIC. 


In this case, it is assumed that all servers belonging to a certain pool are served in cooperation 
with load balancers that forward the traffic based on the connection ID. A server can choose the 
connection ID in the Retry packet such that the load balancer will redirect the next Initial packet 
to a different server in that pool. Alternatively, the load balancer can directly offer a Retry 
offload as further described in [QUIC-RETRY]. 


The approach described in Section 4 of [RFC5077] for constructing TLS resumption tickets 
provides an example that can be also applied to validation tokens. However, the use of more 
modern cryptographic algorithms is highly recommended. 


12. Quality of Service (QoS) and Diffserv Code Point (DSCP) 


QUIC, as defined in [QUIC], has a single congestion controller and recovery handler. This design 
assumes that all packets of a QUIC connection, or at least with the same 5-tuple {dest addr, source 
addr, protocol, dest port, source port}, that have the same Diffserv Code Point (DSCP) [RFC2475] 
will receive similar network treatment since feedback about loss or delay of each packet is used 
as input to the congestion controller. Therefore, packets belonging to the same connection should 
use a single DSCP. Section 5.1 of [RFC7657] provides a discussion of Diffserv interactions with 
datagram transport protocols [RFC7657] (in this respect, the interactions with QUIC resemble 
those of Stream Control Transmission Protocol (SCTP)). 


When multiplexing multiple flows over a single QUIC connection, the selected DSCP value should 
be the one associated with the highest priority requested for all multiplexed flows. 


If differential network treatment is desired, e.g., by the use of different DSCPs, multiple QUIC 
connections to the same server may be used. In general, it is recommended to minimize the 
number of QUIC connections to the same server to avoid increased overhead and, more 
importantly, competing congestion control. 
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As in other uses of Diffserv, when a packet enters a network segment that does not support the 
DSCP value, this could result in the connection not receiving the network treatment it expects. 
The DSCP value in this packet could also be remarked as the packet travels along the network 
path, changing the requested treatment. 


13. Use of Versions and Cryptographic Handshake 


Versioning in QUIC may change the protocol's behavior completely, except for the meaning of a 
few header fields that have been declared to be invariant [QUIC-INVARIANTS]. A version of QUIC 
with a higher version number will not necessarily provide a better service but might simply 
provide a different feature set. As such, an application needs to be able to select which versions 
of QUIC it wants to use. 


A new version could use an encryption scheme other than TLS 1.3 or higher. [QUIC] specifies 
requirements for the cryptographic handshake as currently realized by TLS 1.3 and described in 
a separate specification [QUIC-TLS]. This split is performed to enable lightweight versioning with 
different cryptographic handshakes. 


The "QUIC Versions" registry established in [QUIC] allows for provisional registrations for 
experimentation. Registration, also of experimental versions, is important to avoid collision. 
Experimental versions should not be used long-term or registered as permanent to minimize the 
risk of fingerprinting based on the version number. 


14. Enabling Deployment of New Versions 


QUIC version 1 does not specify a version negotiation mechanism in the base specification, but 
[QUIC-VERSION-NEGOTIATION] proposes an extension that provides compatible version 
negotiation. 


This approach uses a three-stage deployment mechanism, enabling progressive rollout and 
experimentation with multiple versions across a large server deployment. In this approach, all 
servers in the deployment must accept connections using a new version (stage 1) before any 
server advertises it (stage 2), and authentication of the new version (stage 3) only proceeds after 
advertising of that version is completely deployed. 


See Section 5 of [QUIC-VERSION-NEGOTIATION] for details. 


15. Unreliable Datagram Service over QUIC 


[RFC9221] specifies a QUIC extension to enable sending and receiving unreliable datagrams over 
QUIC. Unlike operating directly over UDP, applications that use the QUIC datagram service do not 
need to implement their own congestion control, per [RFC8085], as QUIC datagrams are 
congestion controlled. 
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QUIC datagrams are not flow controlled, and as such data chunks may be dropped ifthe receiver 
is overloaded. While the reliable transmission service of QUIC provides a stream-based interface 
to send and receive data in order over multiple QUIC streams, the datagram service has an 
unordered message-based interface. If needed, an application-layer framing can be used on top 
to allow separate flows of unreliable datagrams to be multiplexed on one QUIC connection. 


16. IANA Considerations 


This document has no actions for IANA; however, note that Section 8 recommends that an 
application that has already registered a TCP port but wants to specify QUIC as a transport 
should register a UDP port analogous to their existing TCP registration. 


17. Security Considerations 


See the security considerations in [QUIC] and [QUIC-TLS]; the security considerations for the 
underlying transport protocol are relevant for applications using QUIC. Considerations on 
linkability, replay attacks, and randomness discussed in [QUIC-TLS] should be taken into account 
when deploying and using QUIC. 


Further, migration to a new address exposes a linkage between client addresses to the server and 
may expose this linkage also to the path if the connection ID cannot be changed or flows can 
otherwise be correlated. When migration is supported, this needs to be considered with 
respective to user privacy. 


Application developers should note that any fallback they use when QUIC cannot be used due to 
network blocking of UDP should guarantee the same security properties as QUIC. If this is not 
possible, the connection should fail to allow the application to explicitly handle fallback to a less- 
secure alternative. See Section 2. 


Further, [QUIC-HTTP] provides security considerations specific to HTTP. However, discussions 
such as on cross-protocol attacks, traffic analysis and padding, or migration might be relevant for 
other applications using QUIC as well. 
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